Adaptive Orthogonal Projection for Batch and Online Continual Learning
نویسندگان
چکیده
Catastrophic forgetting is a key obstacle to continual learning. One of the state-of-the-art approaches orthogonal projection. The idea this approach learn each task by updating network parameters or weights only in direction subspace spanned all previous inputs. This ensures no interference with tasks that have been learned. system OWM uses performs very well against other systems. In paper, we first discuss an issue discovered mathematical derivation and then propose novel method, called AOP (Adaptive Orthogonal Projection), resolve it, which results significant accuracy gains empirical evaluations both batch online learning settings without saving any training data as replay-based methods.
منابع مشابه
Projection-free Online Learning
The computational bottleneck in applying online learning to massive data sets is usually the projection step. We present efficient online learning algorithms that eschew projections in favor of much more efficient linear optimization steps using the Frank-Wolfe technique. We obtain a range of regret bounds for online convex optimization, with better bounds for specific cases such as stochastic ...
متن کاملOnline and Batch Learning Algorithms for Data with Missing Features
We introduce new online and batch algorithms that are robust to data with missing features, a situation that arises in many practical applications. In the online setup, we allow for the comparison hypothesis to change as a function of the subset of features that is observed on any given round, extending the standard setting where the comparison hypothesis is fixed throughout. In the batch setup...
متن کاملOptimal Tuning of Continual Online Exploration in Reinforcement Learning
This paper presents a framework allowing to tune continual exploration in an optimal way. It first quantifies the rate of exploration by defining the degree of exploration of a state as the probability-distribution entropy for choosing an admissible action. Then, the exploration/exploitation tradeoff is stated as a global optimization problem: find the exploration strategy that minimizes the ex...
متن کاملMulti Kernel Learning with Online-Batch Optimization
In recent years there has been a lot of interest in designing principled classification algorithms over multiple cues, based on the intuitive notion that using more features should lead to better performance. In the domain of kernel methods, a principled way to use multiple features is the Multi Kernel Learning (MKL) approach. Here we present a MKL optimization algorithm based on stochastic gra...
متن کاملFrom Batch to Transductive Online Learning
It is well-known that everything that is learnable in the difficult online setting, where an arbitrary sequences of examples must be labeled one at a time, is also learnable in the batch setting, where examples are drawn independently from a distribution. We show a result in the opposite direction. We give an efficient conversion algorithm from batch to online that is transductive: it uses futu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i6.20634